
(1) GENERAL INFORMATION: 



SEQUENCE LISTING 



(i) 



APPLICANT: Oon, Chong Jin 



Lim, Gek Keow 
Leong, Ai Lin 
Zhao, Yi 



Chen, Wei Ning 



TITLE OF INVENTION: A VACCINE- INDUCED HEPATITIS B VIRAL 
STRAIN AND USES THEREOF 



(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Ladas & Parry 

(B) STREET: 26 West 61 Street 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10023 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 09 / 7 19 , 53 3 

(B) FILING DATE: 13-DEC-2000 

(C) CLASSIFICATION: 43 5 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/SG98/00045 

(B) FILING DATE: 19-JUN-1998 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Mass, Clifford J. 

(B) REGISTRATION NUMBER: 30,086 

(C) REFERENCE / DOCKET NUMBER : U-013 10 8 -9 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (212) 708-1890 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CTCCACCACT TTCCACCAAA CTCTTCAAGA TCCCAGAGTC AGGGCCCTGT ACTTTCCTGC 60 

TGGTGGCTCC AGTTCAGGAA CAGTGAGCCC TGCTCAGAAT ACTGTCTCTG C C AT ATCGTC 120 

AATCTTATCG AAGACTGGGG ACCCTGTACC GAACATGGAG AACATCGCAT CAGGACTCCT 180 

AGGACCCCTG CTCGTGTTAC AGGCGGGGTT TTTCTTGTTG ACAAAAATCC TCACAATACC 240 



GCAGAGTCTA GACTCGTGGT GGACTTCTCT CAATTTTCTA GGGGGAACAC CCGTGTGTCT 3 00 



TGGCCAAAAT TCGCAGTCCC AAATCTCCAG TCACTCACCA ACCTGTTGTC CTCCAATTTG 3 60 

TCCTGGTTAT CGCTGGATGT GTCTGCGGCG TTTTATCATC TTCCTCTGCA TCCTGCTGCT 420 

ATGCCTCATC TTCTTGTTGG TTCTTCTGGA CTATCAAGGT ATGTTGCCCG TTTGTCCTCT 480 

AATTCCAGGA TCAACAACAA CCAGCACCGG ACCATGCAAA ACCTGCACAA CTCCTGCTCA 540 

AGGAACCTCT ATGTTTCCCT CATGTTGCTG TACAAAACCT AC GG AC AG AA ACTGCACCTG 600 

TATTCCCATC CCATCATCTT GGGCTTTCGC AAAATACCTA TGGGAGTGGG CCTCAGTCCG 660 

TTTCTCTTGG CTCAGTTTAC TAGTGCCATT TGTTCAGTGG TTCGTAGGGC TTTCCCCCAC 72 0 

TGTCTGGCTT TCAGTTATAT GGATGATGTG GTTTTGGGGG CCAAGTCTGT ACAACATCTT 780 

GAGTCCCTTT ATGCCGCTGT T AC C AATTTT CTTTTGTCTT TGGGTATACA TTTAAACCCT -840 

CACAAAACAA AAAGATGGGG ATATTCCCTT AACTTCATGG GATATGTCAT TGGGAGTTGG 900 

GGCACATTGC CACAGGAACA TATTGTACAA AAAATCAAAA TGTGTTTTAG GAAACTTCCT 960 

GTAAACAGGC CTATTGATTG GAAAGTATGT CAACGAATTG TGGGTCTTTT GGGGTTTGCC 102 0 

GCCCCTTTCA CGCAATGTGG ATATCCTGCT TTAATGCCTT TATATGCATG TATACAAGCA 1080 

AAACAGGCTT TTACTTTCTC GCAAACTTAC AAGACCTTTC TAAGTAAACA GTATCTGAAC 1140 

CTTTACCCCG TTGCTCGGCA ACGCCCTGGT CTGTGCCAAG TGTTTGCTGA CGCAACCCCC 1200 

AC TGG TTGGG GCTTGGCCAT AGGCCATCAG CGCATGCGTG GAACCTTTGT GTCTCCTCTG 1260 

CCGATCCATA CTGCGGAACT CCTAGCCGCT TGTTTTGCTC GCAGCAGGTC TGGGGCAAAA 132 0 

CTCATCGGGA CTGACAATTC TGTCGTGCTC TCCCGCAAGT ATACATCATT TCCATGGCTG 1380 

CTAGGCTGTG CTGCCAACTG GATCCTGCGC GGGACGTCCT TTGTTTACGT CCCGTCGGCG 1440 

CTGAATCCCG CGGACGACCC CTCCCGGGGC CGCTTGGGGC TCTACCGCCC GCTTCTCCGC 1500 

CTGTTATACC GACCGACCAC GGGGCGCACC TCTCTTTACG CGGACTCCCC GTCTGTGCCT 1560 

TCTCATCTGC CGGACCGTGT GCACTTCGCT TCACCTCTGC ACGTCGCATG GAG AC C AC C G 162 0 

TGAACGCCCA CGGGAACCTG CCCAAGGTCT TGCATAAGAG GACTCTTGGA CTTTCAGCAA 1680 

TGTCAACGAC CGACCTTGAG GCATACTTCA AAGACTGTGT GTTTAATGAG TGGGAGGAGT 1740 

TGGGGGAGGA GGTTAGGTTA AAGGTCTTTG TACTAGGAGG CTGTAGGCAT AAATTGGTGT 1800 

GTTCACCATC ACCATGCAAC TTTTTCACCT CTGCCTAATC ATCTCATGTT CATGTCCTAC 1860 

TGTTCAAGCC TCCAAGCTGT GCCTTGGGTG GCTTTGGGGC ATGG AC AT TG ACCCGTATAA 1920 

AGAATTTGGA GCTTCTGTGG AGTTACTCTC TTTTTTGCCT TCTGACTTTT TTCCTTCTAT 1980 

TCGAGATCTC CTCGACACCG CCTCTGCTCT GTATCGGGAG GCCTTAGAGT CTCCGGAACA 204 0 

TTGTTCACCT CACCATACGG CACTCAGGCA AGCTATTCTG AGTTGGGG TG AGTTAATGAA 2100 

TCTAGCCACC TGGGTGGGAA GTAATTTGGA AGATCCAGCA TCCAGGGAAT TAGTAGTCAG 2160 

CTATGTCAAC GTTAATATGG GCCTAAAAAT CAGACAACTA TTGTGGTTTC ACATTTCCTG 2220 

TCTTACTTTT GGGAGAGAAA CTGTTCTTGA ATATTTGGTG TCTTTTGGAG TGTGGATTCG 2280 



CACTCCTCCT 


GCATATAGAC 


CACCAAATGC 


CCCTATCTTA 


TCAACACTTC 


CGGAAACTAC 


2340 


TGTTGTTAGA 


CGAAGAGGCA 


GGTCCCCTAG 


AAGAAGAACT 


CCCTCGCCTC 


GCAGACGAAG 


2400 


GTCTCAATCG 


CCGCGTCGCA 


GAAGATCTCA 


ATCTCGGGAA 


TCTCAATGTT 


AGTATTCCTT 


2460 


GGACACATAA 


GGTGGGAAAC 


TTTACGGGGC 


TTTATTCTTC 


TACGGTACCT 


TGCTTTAATC 


2520 


CTAAATGGCA 


AACTCCTTCT 


TTTCCGGACA 


TTCATTTGCA 


GGAGGACATT 


CTTGATAGAT 


2580 


GTAAGCAATT 


TGTGGGGCCC 


CTTACAGTAA 


ATGAAAACAG 


GAG AC T AAAA 


TTAATTATGC 


2640 


CTGCTAGGTT 


TTATCCAAAT 


GTTACTAAAT 


ATTTGCCCTT 


AGATAAAGGG 


ATC AAAC CAT 


2700 


ATTATCCAGA 


GTATGTAGTT 


AATCATTACT 


TCCAGACGCG 


ACATTATTTA 


CACACTCTTT 


2760 


GGAAGGCGGG 


GATCTTATAT 


AAAAGAGAGT 


CCACACGTAG 


CGCCTCATTT 


TGCGGGTCAC 


2820 


CATATTCTTG 


GGAACAAGAT 


CTACAGCATG 


GGAGGTTGGT 


CTTCCAAACC 


TCGAAAAGGC 


2880 


ATGGGGACAA 


ATCTTTCTGT 


CCCCAATCCC 


CTGGGATTCT 


TCCCCGATCA 


TCAGTTGGAC 


2940 


CCTGCATTCA 


AAGCCAACTC 


AGAAAATCCA 


G ATTGGG AC C 


TCAACCCGCA 


CAAGGACAAC 


3000 


TGGCCGGACG 


CCAACAAGGT 


GGGAGTGGGA 


GCATTCGGGC 


CAGGGTTCAC 


CCCTCCTCAT 


3060 


GGGGGACTGT 


TGGGGTGGAG 


CCCTCAGGCT 


CAGGGCCTAC 


TCACAACTGT 


GCCAGCAGCT 


3120 


CCTCCTCCTG 


CCTCCACCAA 


TCGGCAGTCA 


GGAAGGCAGC 


CTACTCCCTT 


ATCTCCACCT 


3180 


CTAAGGGACA 


CTCATCCTCA 


GGCCATGCAG 


TGGAA 






3215 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Pro Leu Ser Tyr Gin His Phe Arg Lys Leu Leu Leu Leu Asp Glu 
15 10 15 

Glu Ala Gly Pro Leu Glu Glu Glu Leu Pro Arg Leu Ala Asp Glu Gly 
20 25 30 

Leu Asn Arg Arg Val Ala Glu Asp Leu Asn Leu Gly Asn Leu Asn Val 
35 40 45 

Ser lie Pro Trp Thr His Lys Val Gly Asn Phe Thr Gly Leu Tyr Ser 
50 55 60 

Ser Thr Val Pro Cys Phe Asn Pro Lys Trp Gin Thr Pro Ser Phe Pro 
65 70 75 80 

Asp lie His Leu Gin Glu Asp lie Leu Asp Arg Cys Lys Gin Phe Val 
85 90 95 

Glu Pro Leu Thr Val Asn Glu Asn Arg Arg Leu Lys Leu lie Met Pro 
100 105 110 



Ala Arg Phe Tyr Pro Asn Val Thr Lys Tyr Leu Pro Leu Asp Lys Gly 
115 120 125 

lie Lys Pro Tyr Tyr Pro Glu Tyr Val Val Asn His Tyr Phe Gin Thr 
130 135 140 

Arg His Tyr Leu His Thr Leu Trp Lys Ala Gly lie Leu Tyr Lys Arg 
145 150 155 160 

Glu Ser Thr Arg Ser Ala Ser Phe Cys Gly Ser Pro Tyr Ser Trp Glu 
165 170 175 

Gin Asp Leu Gin His Gly Arg Leu Val Phe Gin Thr Ser Lys Arg His 
180 185 190 

Gly Asp Lys Ser Phe Cys Pro Glu Ser Pro Gly lie Leu Pro Arg Ser 
195 200 205 

Ser Val Gly Pro Cys lie Gin Ser Gin Leu Arg Lys Ser Arg Leu Gly 
210 215 220 

Pro Gin Pro Ala Gin Gly Gin Leu Ala Gly Arg Gin Gin Gly Gly Ser 
225 230 235 240 

Gly Ser lie Arg Ala Arg Val His Pro Ser Ser Trp Gly Thr Val Gly 
245 250 255 

Val Glu Pro Ser Gly Ser Gly Pro Thr His Asn Cys Ala Ser Ser Ser 
260 265 270 

Ser Ser Cys Leu His Gin Ser Ala Val Arg Lys Ala Ala Tyr Ser Leu 
275 280 285 

lie Ser Thr Ser Lys Gly His Ser Ser Ser Gly His Ala Val Glu Leu 
290 295 300 

His His Phe Pro Pro Asn Ser Ser Arg Ser Gin Ser Gin Gly Pro Val 
305 310 315 320 

Leu Ser Cys Trp Trp Leu Gin Phe Arg Asn Ser Glu Pro Cys Ser Glu 
325 330 335 

Tyr Cys Leu Cys His lie Val Asn Leu lie Glu Asp Trp Gly Pro Cys 
340 345 350 

Thr Glu His Gly Glu His Arg lie Arg Thr Pro Arg Thr Pro Ala Arg 
355 360 365 

Val Thr Gly Gly Val Phe Leu Val Asp Lys Asn Pro His Asn Thr Ala 
370 375 380 

Glu Ser Arg Leu Val Val Asp Phe Ser Gin Phe Ser Arg Gly Asn Thr 
385 390 395 400 

Arg Val Ser Trp Pro Lys Phe Ala Val Pro Asn Leu Gin Ser Leu Thr 
405 410 415 

Asn Leu Leu Ser Ser Asn Leu Ser Trp Leu Ser Leu Asp Val Ser Ala 
420 425 430 

Ala Phe Tyr His Leu Pro Leu His Pro Ala Ala Met Pro His Leu Leu 
435 440 445 

Val Gly Ser Ser Gly Leu Ser Arg Tyr Val Ala Arg Leu Ser Ser Asn 
450 455 460 



Ser Arg lie Asn 
465 



Ser Cys Ser Arg 



Tyr Gly Gin Lys 
500 



Arg Lys lie Pro 
515 



Phe Thr Ser Ala 
530 

Leu Ala Phe Ser 
545 

Gin His Leu Glu 



Leu Gly lie His 
580 

Leu Asn Phe Met 
595 



Glu His lie Val 
610 

Asn Arg Pro lie 
625 

Gly Phe Ala Ala 



Leu Tyr Ala Cys 
660 

Tyr Lys Thr Phe 
675 

Arg Gin Arg Pro 
690 

Gly Trp Gly Leu 
705 



Ser Pro Leu Pro 



Arg Ser Arg Ser 
740 



Leu Ser Arg Lys 
755 



Asn Trp lie Leu 
770 

Asn Pro Ala Asp 
785 

Leu Leu Arg Leu 



Asn Asn 
470 

Asn Leu 
485 

Leu His 

Met Gly 

lie Cys 

Tyr Met 
550 

Ser Leu 
565 

Leu Asn 
Gly Tyr 

Gin Lys 

Asp Trp 
630 

Pro Phe 
645 

lie Gin 

Leu Ser 

Gly Leu 

Ala lie 
710 

lie His 
725 

Gly Ala 

Tyr Thr 

Arg Gly 

Asp Pro 
790 

Leu Tyr 
805 



Glu His 

Tyr Val 

Leu Tyr 

Val Gly 
520 

Ser Val 
535 

Asp Asp 

Tyr Ala 

Pro His 

Val lie 
600 

lie Lys 
615 

Lys Val 

Thr Gin 

Ala Lys 

Lys Gin 
680 

Cys Glu 
695 

Gly His 

Thr Ala 

Lys Leu 

Ser Phe 
760 

Thr Ser 
775 



Ser Arg 



Arg Pro 



Arg Thr 

Ser Leu 
490 

Ser His 
505 

Leu Ser 

Val Arg 

Val Val 

Ala Val 
570 

Lys Thr 
585 

Gly Ser 

Met Cys 

Cys Gin 

Cys Gly 
650 

Gin Ala 
665 

Tyr Leu 

Val Phe 

Gin Arg 

Glu Leu 
730 

He Gly 
745 

Pro Trp 

Phe Val 

Gly Arg 

Thr Thr 
810 



Met Glu 
475 

Met Leu 

Pro He 

Pro Phe 

Arg Ala 
540 

Leu Gly 
555 

Thr Asn 
Lys Arg 
Trp Gly 

Phe Arg 
620 

Arg He 
635 

Tyr Pro 



Phe Thr 



Asn Leu 

Ala Asp 
700 

Met Arg 
715 

Leu Ala 

Thr Asp 

Leu Leu 

Tyr Val 
780 

Leu Gly 
795 

Gly Arg 



Asn Leu 

Leu Tyr 

He Leu 
• 510 

Leu Leu 
525 

Phe Pro 

Ala Lys 

Phe Leu 

Trp Gly 
590 

Thr Leu 
605 

Lys Leu 

Val Gly 

Ala Leu 

Phe Ser 
670 

Tyr Pro 
685 

Ala Thr 

Gly Thr 

Ala Cys 

Asn Ser 
750 

Gly Cys 
765 

Pro Ser 
Leu Tyr 



Thr Ser 



His Asn 
480 

Lys Thr 
495 

Gly Phe 

Ala Gin 

His Cys 

Ser Val 
560 

Leu Ser 
575 

Tyr Ser 
Pro Gin 

Pro Val 

Leu Leu 
640 

Met Pro 
655 

Gin Thr 
Val Ala 



Pro Thr 

Phe Val 
720 

Phe Ala 
735 

Val Val 

Ala Ala 

Ala Leu 

Arg Pro 
800 

Leu Tyr 
815 



Ala Asp Ser Pro Ser Val Pro Ser His Leu Pro Asp Arg Val His Phe 
820 825 830 



Ala Ser Pro Leu His Val Ala Trp Arg Pro Pro 
835 840 



INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Lys Ala Asn Ser Glu Asn Pro Asp Trp Asp Leu Asn Pro His 
35 40 45 

Lys Asp Asn Trp Pro Asp Ala Asn Lys Val Gly Val Gly Ala Phe Gly 
50 55 60 

Pro Gly Phe Thr Pro Pro His Gly Gly Leu Leu Gly Trp Ser Pro Gin 
65 70 75 80 

Ala Gin Gly Leu Leu Thr Thr Val Pro Ala Ala Pro Pro Pro Ala Ser 
85 90 95 

Thr Asn Arg Gin Ser Gly Arg Gin Pro Thr Pro Leu Ser Pro Pro Leu 
100 105 110 

Arg Asp Thr His Pro Gin Ala Met Gin Trp Asn Ser Thr Thr Phe His 
115 120 * 125 

Gin Thr Leu Gin Asp Pro Arg Val Arg Ala Leu Tyr Phe Pro Ala Gly 
130 135 140 

Gly Ser Ser Ser Gly Thr Val Ser Pro Ala Gin Asn Thr Val Ser Ala 
145 150 155 160 

lie Ser Ser lie Leu Ser Lys Thr Gly Asp Pro Val Pro Asn Met Glu 
165 170 175 

Asn lie Ala Ser Gly Leu Leu Gly Pro Leu Leu Val Leu Gin Ala Gly 
180 185 190 

Phe Phe Leu Leu Thr Lys lie Leu Thr lie Pro Gin Ser Leu Asp Ser 
195 200 205 

Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Thr Pro Val Cys Leu Gly 
210 215 220 

Gin Asn Ser Gin Ser Gin lie Ser Ser His Ser Pro Thr Cys Cys Pro 
225 230 235 240 



Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie 
245 250 255 

Phe Leu Cys lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu 
260 265 270 

Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu lie Pro Gly Ser Thr 
275 280 285 

Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr Thr Pro Ala Gin Gly 
290 295 300 

Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Thr Asp Arg Asn 
305 310 315 320 

Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Lys Tyr Leu 
325 330 335 

Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu Val Pro 
340 345 350 

Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser Val 
355 360 365 

He Trp Met Met Trp Phe Trp Gly Pro Ser Leu Tyr Asn He Leu Ser 
370 375 380 

Pro Phe Met Pro Leu Leu Pro He Phe Phe Cys Leu Trp Val Tyr He 
385 390 395 400 



INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser He Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Ser Trp Gly Glu Leu Met Asn 
85 90 95 

Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala Ser Arg Glu 
100 105 110 

Leu Val Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys He Arg Gin 
115 ^ 120 125 



Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

Leu Glu Tyr Leu Val Ser Phe Gly^Val Trp lie Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Ala Ala Arg Leu Cys Cys Gin Leu Asp Pro Ala Arg Asp Val Leu 
15 10 15 

Cys Leu Arg Pro Val Gly Ala Glu Ser Arg Gly Arg Pro Leu Pro Gly 
20 25 30 

Pro Leu Gly Ala Leu Pro Pro Ala Ser Pro Pro Val lie Pro Thr Asp 
35 40 45 

His Gly Ala His Leu Ser Leu Arg Gly Leu Pro Val Cys Ala Phe Ser 
50 55 60 

Ser Ala Gly Pro Cys Ala Leu Arg Phe Thr Ser Ala Arg Arg Met Glu 
65 70 75 80 

T h r Thr Val Asn Ala His Gly Asn Leu Pro Lys Val Leu His Lys Arg 
85 90 95 

Thr Leu Gly Leu Ser Ala Met Ser Thr Thr Asp Leu Glu Ala Tyr Phe 
100 105 110 

Lys Asp Cys Val Phe Asn Glu Trp Glu Glu Leu Gly Glu Glu Val Arg 
115 120 125 

Leu Lys Val Phe Val Leu Gly Gly Cys Arg His Lys Leu' Val Cys Ser 
130 135 140 

Pro Ser Pro Cys Asn Phe Phe Thr Ser Ala 
145 150 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
ATAAGCTTAT GCCCCTATCT TATCAACACT TCCGGA 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GAGTCTAGAC TCTGCGGTAT TGTGA 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GAGTCTAGAC TC GTGGTGG A CTTCT 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TGAGAATTCT CACGGTGGTC TCCATGCGAC GT 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
TTTGTTTACG TCCCGT 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 27 
CTAAGCTTAG TTTCCGGAAG TGTTGAT 



